DOCUMENT RESUME 



ED 366 196 



FL 021 754 



AUTHOR 
TITLE 

PUB DATE 
NOTE 



PUB TYPE 



Kennedy, Graeme D. 

Collocations: Where Grammar and Vocabulary Teaching 

Meet. 

90 

17p.; In Sarinee, Anivan, Ed. Language Teaching 
Methodology for the Nineties. Anthology Series 24; 
see FL 021 739. 

Viewpoints (Opinion/Position Papers, Essays , etc.) 
(120) — Speeches/Conference Papers (150) 



EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



MF01/PC01 Plus Postage. 

Computational Linguistics ; Foreign Countries; 
^Grammar; ^Language Patterns; ^Language Processing; 
Language R2search; Second Language Instruction; 
^Second Language Learning; ^Vocabulary Development; 
Word Frequency 
*Collocations 



ABSTRACT 

Traditionally, the study of language patterns has 
been viewed primarily in terms of rules of grammar and discourse and 
of vocabulary choice. Researchers are now exploring the nature of 
collocations, or patterns of word sequence or co-occurrence in 
discourse. Most of the attention has been focused on colorful 
collocations, not on more ordinary usage. Computer analysis of large 
corpora now make description of patterns possible. An analysis of the 
use of four English prepositions ("at, from, between, through 11 ) in 
collocation in onv* large corpus of British English illustrates the 
potential of this area of study. Results of the analysis indicate 
that the prepositions have distinctive patterns of co-occurrence with 
different form classes (e.g., nouns vs. verbs), and can not be viewed 
or taught as relatively interchangeable grammatical items. Some 
problems ir» interpreting and using collocation analyses persist, such 
as judgments about significance of word sequences as collocations, 
and the number of words that can occur between elements of the 
collocation. However, study of collocations may have implications for 
theories of language learning, theories and models of language 
processing, content of language instruction, and pedagogical 
practice. (MSE) 



ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft ft * *' ' ft * * * 

* Reproductions supplied by EDRS are the best that can be made * 

* from the original document. * 

ftftftftftftftftftftftftftftftftftftftftftftftftftftftftftftftft 



COLLOCATIONS: WHERE GRAMMAR AND 
VOCABULARY TEACHING MEET 

GRAEME D. KENNEDY 



Q 



PERMISSION TO REPRODUCE THIS 
MATERIAL HAS BEEN GRANTED BV 



--JKtiik 



■"O THE EDUCATIONAL RESOURCES 
INFORMATION CENTER (b-RiD 



U.S. DEPARTMENT OF EOUCATlON 

OH«"P ol 6 dm alior.a' Research and Improvement 

E DUCATIONAI RFSOURCFS INFORMATION 
CFNTE R iHRCi 

document has been reproduced as 
letfiived tiom the person or organization 
Originaltmj it 
: Minor ■ haoges have been made to imniuve 
rpprodurtiOn Q'jarity 



Points Of vi»* »v or opinions Mated inthi^d<*f u 
ment do not ner essaniy represent <>ttn ial 
OL R« posits POIk y 




COLLOCATIONS: WHERE GRAMMAR AND 
VOCABULARY TEACHING MEET 

Graeme D Kennedy 



Language teachers are well aware that fashions or emphases change in their 
profession every few years. In the last decade or so, for example, there has been 
a focus at different times on the language learner, on the use of language, on 
authenticity of the spoken or written texts to which the learner is exposed, on 
interaction in the learning context, on communicative teaching, and on the 
teacher as an organizer of opportunities for learning. All of these have been 
important emphases. But there has also been, to the bewilderment of some 
language learners, an unwillingness by many teachers in recent years to focus on 
grammatical form or to analyse the units of the language being learned. 

As Sinclair (1985) has written, however, "absence of interest in what one is 
teaching is surely a perilous condition". Perhaps not surprisingly, therefore, 
there have recently been calls by applied linguists for a re-examination of the 
role of grammar in language teaching. At the same time, while the future can 
hardly be expected to lie in a sterile emphasis on teaching grammar and vocabu- 
lary as an unapplied system, neither can language teaching be improved simply 
by slogans such as 'Grammar is a good thing'. The purpose of this paper is to 
suggest that text-based pedagogically-appropriate descriptions of language need 
more emphasis as part of language teacher education in that they properly form 
part of methodology, informing curriculum designers and classroom teachers not 
only how a language is put together, but also throwing new light on what some of 
the units of learning might be. In this sense, more emphasis on pedagogical 
grammar can complement the greater focus on empirically-based instructional 
activities or learning tasks, a focus which promises to be important in the years 
ahead (Crookes, 1986). 

The growing availability of microcomputers has begun to make easier the 
analysis of texts and there are indications that it might be possible to reinterpret 
what constitutes grammar and vocabulary respectively and thus enhance our 
understanding of what it is we learn when we learn a language. I am referring, 
of course, to research on the company words tend to keep, the routines, set 
phrases or collocations we habitually use when we speak or write. 

The mainstream of both theoretical and applied linguistics has been fasci- 
nated over the last two or three decades by the generative character of language 
and especially its creative or innovative nature. 



215 3 



Chomsky, for example, who was probably the greatest single influence, 
made claims such as the following: 



We constantly read and hear new sequences of words, recognize them as 
sentences and understand them. It is easy to show that the new events that 
we accept and understand as sentences are not related to those with which 
we are familiar by any simple notion of formal (or semantic or statistical) 
similarity or identity of grammatical frame. (1959: 57) 

Chomsky was of course reacting against behaviourist models of learning 
and especially against Skinnerian notions of verbal chaining. However, not 
everyone would agree that novelty lies at the heart of language use, and we do 
not have to go to Skinner for a statement to that effect. For example, that cele- 
brated sailor, novelist and learner of English as a second language, Joseph 
Conrad, wrote in his great novel Nostromo: 

The value of a sentence is in the personality which utters it, for nothing 
new ran he saiil by man 01 woman. (l ( X)4: 183) 

Hie issue is then - l>o we have largely open choice in rule-governed 
gi tunmatu al h nines in Ilu* woiils we use, or do we learn anil use collocations lo 
a gu ulei r\lent than is usually u cognized? Although behaviourist models of 
language teaming no lougei rnjov wiilespieail cniiencv, reseat eh on collocations 
suggeMs that automata it v oi habit loimalion tiom an minimal ton -pioccssing or 
skills peispeeiive still has some explanatory power. The extent to which colloca- 
tions occur also suggests that il may be possible lo leach some of what has usual- 
ly been considered as grammar in terms of vocabulary. Thus, for example, at the 
present time can be considered from a grammatical viewpoint to be a preposi- 
tional phrase, or it can be viewed as a lexicalized unit which is often synonymous 
with the word now. 

In a statement as well known as that quoted above, Chomsky (1965: 5) 
characterized so-called traditional grammars as being deficient in that they leave 
unexpressed many of "the basic regularities of the language with which they are 
concerned". 

Traditionally and conventionally, regularity in language has been seen 
primarily in terms of rules of grammar (and discourse), and in vocabulary 
choice. In the last decade, however, a number of researchers have explored the 
nature of collocations as a particular type of regularity - the occurrence of par- 
ticular sequence of words in language use by first and second language learners. 

Papers by Krashcn and Scarcella (1978), Nattingcr (1980), Pawley and 
Syder (1983), Peters (1980) and Sinclair (1987) are among many which have 
summarized research on collocations and most recently there have been diction- 



216 



aries which record or take account of collocations (Benson et al, 1986; Sinclair et 

Regrettably there is something of a forest of terminology, much of which 
overlaps. Researchers have often used different terms, many of which are 
synonymous, for collocation. These include the following (cf. Becker, 1975): 

prefabricated routines (how are you) 

prefabricated patterns (that's a ) 

sentence builders (that's a ) 

unassimilated fragments C t0 meet y ou " ^ a B^eting) 

formulaic speech (as a matter of fact) 

idioms (kick the bucket) 

cliches ( as a ma tter °f f ac 0 

lexicalized sentence stems (as a matter of fact) 

non-canonical forms (on with the show) 

polywords (the powder room) 

phrasal constraints (by pure coincidence) 

deictic locutions (as a matter of fact) 

situational utterances (I'm glad to meet you) 

verbatim texts (oozing charm from every pore) 

fixed phrases (in brief; at the present time) 

set phrases (in brief; at t\*c present time) 

Sometimes, the term "patterned speech" has been used to include all the 
above. Since it is not the purpose of the present paper to discuss the various 
varieties of patterned speech, the word collocation is used here to include any 
recurring sequences of words. Suffice to say that whereas some researchers such 
as Krashen and Scarcella deny that collocations constitute "a large part of lan- 
guage", other researchers such as PawJey, Nattinger and Sinclair have argued 
that they are overwhelmingly pervasive. 

In the research literature, the focus has been on the learning and use in dis- 
course of what are often colourful collocations such as those illustrated. Howev- 
er, little attention has been paid to less striking but no less pervasive patterning 
throughout the grammar. Yet if the theory of collocation is to work, it has to 
work at the less striking, more mundane level. For example, English preposi- 
tions are considered to be hard to learn and teach, yet ten or twelve prepositions 
constitute about 10% of any spoken or written text. Computer analysis of large 
corpora makes possible the description of patterning and indeed shows that it 
exists to a striking extent at the level of the prepositional phrase. The remainder 
of this paper presents data from a computer-assisted analysis of the use of four 
English prepositions, AT, FROM, BETWEEN and THROUGH - part of a 
study of the ten most frequent prepositions in the LOB (Lancaster-Oslo-Bergen) 



217 

5 



corpus (Johansson et al, 1978). 

Rri,!? P LC ? B K C0 [P US is a 1-miHion-word representative sample of adult written 
British English. It is made up 500 samples, each of 2,000 words from Tw de 
variety of genre. Although the texts m the LOB corpus are now almost 25 vTars 
old it is one of the most accessible databases for computer-assist^S 

k ZeThT i 6 DgUage Ch3ngCS C0DStantly ' U is ,ike 'y that P-positional usa^e 
is more stable than content word usage. S 

There are about 6000 occurences of AT in the one-million-word LOB 

ZIT , ^ 15 ° ,6% ° f thC W ° rdS ' ° r ° ne AT in evcr y ™ words. FROM is 
slightly less frequent, occurring about once in every 216 words BETWEEN 

occurs about once in 1,164 words, while THROUGH occurs about once in 1,314 

™^w iS ™ diffkult t0 find Patterning in the use of the prepositions AT 
FROM BETWEEN aud THROUGH in the corpus. For example Tabl its a 
rank ordering of the 142 collocations beginning with AT which occur four or 
more times. They total 2,575 tokens, thus accounting for 43% of the uses of AT 
m the corpus. Close examination of Table 1 shows that a few collocations oc- 
curred w,th very high frequency; others, marked with an asterisk, probably re- 

Tate Gallery)- stlil others, while apparently formulaic, did not occur JLJ often 
(egai the most occurred only four times). 

A further 932 tokens of AT occurred before the names of towns, institu- 
tions or events (eg at Ascot) but because none of these individual place names 
occurred four or more tunes, they are not listed in Table 1. Similarly, there were 
236 tokens m the corpus of AT followed by personal pronoun (eg at her, at him). 
Li ZT « ' : nSt " US,ons or events and the various personal pronouns 
SnTim A^m Ph£ °J CO,locations ( AT + (THE) + PROPER NOUN 
nunTh? T .1 ^ CE) K and ^ + PERSONAL PRONOUN) then the total 
number of coltoions beginnmg with AT occurring four or more times as listed 
•n Table 1 would be 3,743, or 63% of the tokens in the corpus 

Thus, in a single table, almost two-thirds of the collocations beginning with 
VT in a representative sample of written British English can be indicated. As 
Table 1 shows at least was the most frequent collocation, while others of les-, 
requency such as at the tailplane may not be formulaic at all. Such a table may 

111 USC , l ° dcsi 8 ners in decking the coverage of materials for 

anguage teaching, but is probably not of major theoretical interest 

It is, of course, possible to provide similar tables for each of the other 
•repositions. In this paper, however, if will be of more value to compare the 
our prepositions with regard to the left and right collocations they are associal- 
d with Such a comparison shows that to treat these prepositons grammatically 
is roughly substitutable parts of speech can be very misleading. Yet most 
rammars of English do assume that English prepositions behave in a similar 



O 218 6 



Tabic 1 Right collocations of AT arranged in order of frequency 



at a iWMrat 
4t all 
at Laat 
at onca 

at the um tine 
at the end (at Uwl 
at koat 
at the ti*a 
•at which 
at ptaaant 
at flrsx 
at aft? rata 
at night 

at the wa n t ( of i 
at tha top 
at tin* a 

at tha beginning (of) 
at thia tuw 
at work 

at tha •eatlne (of) 

at that tlM 

at tha afa of 

at tha back (of) 

at any tzam 

at tha bottoa (of) 

at tha preaent tuw 

at about i 

at tha expanaa of 

at school 

at this stag* 

at thia point 

at ona tie* 

at a point 

at length 

at tha h»dd of 
•at tha saaa 

at tha sida lot I 
•at tha door 

at a tu* 

at a txm* when 
•at Ca»bridoa 
•at what 

at tha point (of) 
•at tha University 

at dihnar 

at that aoatent 

at. (cleuie final) 

at hand 

at larva. 

at that. 

at tha foot (of) 
at tha start 
at tha aurfaca 

•at various 

at rand oat 
at a«a 

at tha front (of) 
at ease 

at first tight 
at all tiMi 
at a coat o( 
at intarvala 
at tha of *ice 
at tha rasa (off 
at this aeeent 
•at London Airport 
at tha tabla 
at tha weekend 
at tha centra (of) 
at tho corner (oft 
at ana end (of) 
at tha Heart of 



249 
Itl 

1T5 
111 
!• 

*2 
tl 

» 

77 
«1 

57 
SO 
J4 
34 
14 
II 
30 
30 
21 
2* 
23 
24 
24 
22 
21 
20 
20 
1* 
1» 
H 
II 
17 
1? 
15 
IS 
IS 
15 
14 
14 
IJ 
13 
13 
12 
12 
11 
11 
11 
10 
10 
10 
10 
10 
10 
10 

» 

t 
I 
I 
I 
I 
I 
I 
I 
1 

7 
7 
7 
7 
7 



•at tha total 

♦at a temperature (of) 

•at a eweting (of) 

at any aoa n wt 

at boat 

at dawn 
*at his deak 

at raat 

at ataka 
•at technical college a 

at tha adga (of) 

at tha aound (of) 

at tha thought (of) 
•at ftanchester 
•at Oxford 
•at Co rant Cardan 
•at Cfcriatvaa 

at tha torn of 
•at tha school 

at tha whaal 

at tha worst 
•at tha India Offica 
•at tha July saucing 
•at tha church 

at tha cloae of 

at tha cost of 

at tha far and 

at tha first 
•at tha gate 

at tha raar (of) 

at haart 

at wit 

at eight angles 
at a latar date 
at a rata of 
at a latar stage 
at a loss 
at all costs 
at all levels 
at aro't Length 

*at around _ 

at coilagt 
at each othar 
at' fault 

at high tenperttures 
at low teetperatuces 
•it his faat 
at Its bast 
at long la*t 
at mdnight 
at paaca 

at tha basa (of) 
•at the danca 
•at tha election 
•at the hospital 
•at tha home 

at the last etoaent 

ac the eost. 
•at the level of 

at tha ready 

at tha root (of) 
•at tha other 
•at tton 

at tha way 
•at the tallplane 
•at the roreifn Offica 
•at he Tate Gallery 
•at u.iivertitiat 

at will 

at one point 

at one. 

Total 



ierJc 



219 



1 



fashion, differing mainly in their so-called locative meanings. 

Tables 2 and 3 compare the right and left collocations of the four preposi- 
tions. The rank ordering of the words which occur most frequently before and 
after the four prepositions are not strictly comparable because the preposition 
AT, for example, is much more frequent than BETWEEN or THROUGH and 
therefore the actual number of tokens of the collocations in each category are 
themselves not strictly comparable. To assist comparisons, therefore, a line is 
drawn across each column at approximately the point where a collocation occurs 
once in every 200 instances (or 0.5%) of that preposition. It is immediately 
apparent, for example, in Table 2, that whereas AT occurs in twenty right collo- 
cations which have a frequency greater than 0.5%, FROM has only three right 
collocations with comparable frequency, and only from time to time among these 
seems lexicalized. AT collocates strongly with certain preceding and following 
words, whereas BETWEEN and THROUGH tend to collocate most strongly 
with preceding words, as a comparison of Tables 2 and 3 shows. 

A particularly striking point to note in Table 3 is that the prepositions can 
differ markedly not only in the particular lexical items which precede or follow 
them, but also in the parts of speech which the collocating items represent. 
Thus, as Table 3 shows, the most frequent words immediately preceding 
BETWEEN are nouns (eg difference, relationship). The most frequent words 
preceding THROUGH arc typically verbs (eg go, pass, come). 

From the evidence for these four prepositions , they cannot be taught as 
grammatical items which can be substituted for each other, differing only in the 
basic locative meaning in each case. 

In fact, the basic locative meanings of AT, FROM, BETWEEN and 
THROUGH do not notably stand out in the most frequent collocations which 
these four prepositions form part of. In English language teaching, however, it is 
the basic locative meanings which normally constitute the main pedagogical 
focus. 

Text-based descriptions of the company kept by individual prepositions can 
also indicate the relative frequency of recurrent patterns of words and this 
should influence the work of curriculum designers and classroom teachers. For 
example the basic locative use of AT followed by a noun which is part of some- 
thing occurs 281 times in the LOB corpus, (about 5% of the occurrences of AT). 
These are listed alphabetically in Table 4. However, not all are of equal likeli- 
hood of occurrence, as Table 4 shows. 



Tables 2 Comparison of rank ordering of right collocations 



_4 


— • 










m 


£ 5 










c 


3 4, 












0 c 










<*-j 


c 












0 w 


0 




c 


• * 


• 


u - 


•a 






U M V <- 


M 


Q- C 


c 




— g 


O 3 D - 


3 


• 0 




V 


II 


0 0 w <c 


«J 


W t* 


J 


.0 


0 — 






u u 






fO Id £ 




u 


V V 


H 


§ 


~ - " 


y y y u 




a a 




0 c 




£ J3 .£ H 








4j 




*J *J +J £Z 



o y o >» * » CM <* « — 

ao tro 4 o • b -* « 

^ J "O w S O ±j jJ « u 

j= J* O u « O -* V « 

<0 <0 j= -£ US -£ -£ «> - V «3 u 0 

O — 4-> 4J 4J 4J *J fl «t- £ £ — ca. C t* 



W « O 
3 > 

4J U • 

C 0 • 

• O >- 

o -o • 



• «3 

>. o 
o o 
** «p 



v m m o « • 



c o — 

• J 3 

t/> v» u 



-0 — -» 3 
~ 3 -» - 3 



3i n ■ 



. - o 
C *3 C 



C O » C £(A«A « 

•o 2 y u y u w o £ 

J (T CT" CJ U — O 

■a c j vi J) dc 
0C.= D3*33-2y^ 



M tfl y 4J © 



o§ in » cic aj^vD^"^ — — ooooa3a>f 



«• — 

c u 
o v 

U £ 

3 3 

a - 



•J 31 C 

313* 0 * 
9 — iC "3 



3 J 

"5 3 



C 

~ - 
3 0 



c f 

- £ O 
*J C O 

u - w -a c 
cr o e 

— «J 3 <« - 
0 - 

~ 3 "■ 



CI 3 



- a v 3 *j i- u -o e o 

00> CP 03 "3 •* C J i «T 

a*JOcrc .= o c u c.a « c - ; 

— c - v u t> - u o 

u30»»»-» — > O • • — • • S * * 

u — — r^-s — o * 9 £ — c — x: X — - O .3 

v» *j o w j y ^ « cr — -> iJ <j o a *j u j j >< ^ 



C 3 



fM fM — — — 



|0**N*NM — — — —— — — — — 



— O *J 0 

CP 0 U • 

C — 6 C « O 

-30 o o e cp 

ii - « u £ *j « Q- ^ 

y *j cr <o - O *- x — «« 

£ <j3 aa • O ^ 

' « c - -c - - £i- o---^:=--^^u-c 

^ c 3 i> ij *> j-i < *> • *J «• ** 



. e a at 

: a oo ft* 

i w U 4J £ <J * -Ql 



- • 

C £ 

a - 



o 

•3 

hi 4 

C JZ O 

- ** c 

0 3» 

a c 3 

« — w 



221 



Table 3 Comparisons of rank ordering of left collocations 



„ ». .. U C JZ 



0 0 
. * • o 0 c 



« c 

u. — 0 

o 

0w.500.car 
e c.o a i- 0 %* -i 



• o o*» o*» o*» <o * 



• o o 
u •-» C 
0 *J •-. 

«i 0 AJ 
— VI 

— a» — 



C U 4 • 



•~ tr *> .00 
bC 'J >-***• 
4* « 0 
o ■ c ^ « ■»* -- • - 

n — • e ■ - c n « c ^ s 

-.0X0*0 •-* cc-»oos — « . 



— 3 
0 a» 
> c 
u -* 
w ** • 



w c 

• 0 

► -o — 

n r o * 
i o -3 a 



4J — U 
- 0 * 

C u — ( 
O * € 

u i 
a c"» i 



>- «u J • 

w «j S *** * ® 

3l0w — ~ u u 
0I> O JS *3 t> 



c — 
u « r i 
0 ■ r 



i 0 3 
. £ 0. 



*m u e 

: — • 0 

: -o — — 



3 C S 
3 0 J — 



222 



Table 4 AT + THE + noun which is part of something 



No. tokens 



back 


22 


base 


4 


bottom 


in 


centre 


H 

I 


corner 


n 
l 


door 


14 


edge 


O 


end 


88 


foot 


10 


front 


9 


head 


15 


heart 


7 


point 


12 


rear 


5 


side 


14 


surface 


10 


top 


21 




281 



Similarly, Table 5 shows what is perhaps really a commonsense patterning 
in the rank ordering of the occurrence of personal pronouns after the four 
prepositions, but one which shows that BETWEEN behaves somewhat differ- 
ently from the other three, in that plural pronouns are most frequent after 
BETWEEN. 

Table 5 Rank ordering of occurrences of personal pronouns following 
AT, FROM, BETWEEN and THROUGH 



AT 



FROM BETWEEN THROUGH 



him 
her 
mc 
it 

them 

you 

us 



67 it 

58 him 
41 
39 

15 me 

10 you 

6 us 



29 
28 

her 18 
them 16 
15 
4 

3 



them 36 
us 13 



her 
him 
you 
it 

mc 



5 
4 
3 
3 
1 



it 

him 

them 

her 

me 

you 

us 



12 
7 
6 
2 
1 
1 
0 



223 | ^ 



The data in Table 6 shows quite striking differences in the part of speech 
likely to occur immediately before each of the four words. THROUGH, for 
example, shows verbs as the most frequent category, whereas the other three 
show nominals as the most frequent, most strikingly so in the case of BE- 
TWEEN. FROM is less likely than the other words to begin a sentence or 
clause, although as Table 2 shows FROM, BETWEEN and THROUGH often 
end a sentence or clause. 



Tabic 6 Parts of speech occurring immediately before 
AT, FROM, BETWEEN and THROUGH 





% of tokens 








AT 


FROM 


BETWEEN 


THROUGH 


Nouns or pronouns 


41.6 


45.0 


66.2 


28.7 


Verbs 


31.6 


29.3 


16.2 


44.0 


Adjectives 


3.1 


4.8 


1.7 


3.4 


Other P.O.S. 


16.7 


17.2 


10.1 


15.0 


Clause initial 


7.0 


3.7 


5.7 


8.9 



In spite of the information which can be found by studying collocations in 
:orpora, there are nevertheless some major problems in interpreting and using 
;uch information as is found in Tables 1-5. First, while there arc some word 
sequences which we can be confident are lexicalized as a single unit (eg at the 
nornent) y there are other sequences which, while occurring reasonably frequent- 
y, do not have such a strong sense of belonging together (zgfrom the outside). 
Jn the other hand, there are others which occur in a particular corpus perhaps 
>nly once or twice, yet are recognized by users of the language as familiar or 
brmulaic. Table 7 contains some such examples of collocations with AT. 

Without psycholinguistic research, it is of course not possible to make valid 
udgements about which word scqucrces arc significant as collocations and 
vhich arc not. 

Second, some collocations can be discontinous and therefore the stud j of 
ccurring adjacent sequences alone is not enough to get a picture of how frc- 
[uent a particular collocation really is. In the following sentence from the LOB 

224 1 2 



Table 7 Collocatioas with AT which occur infrequently in the LOB corpus 



corpus, for example, six words coror between different and from. 

Non-cooperators were not different in age or other environmental factor 
from the rest. 

In the corpus, the word different occurs 364 times. On 21 occasions, it is imme- 
diately followed by from; on another eight occasions different has one intervening 
word before from ; on two occasions there are two intervening words; once each 
there are three or four intervening words; and twice there are six. On 329 occa- 
sions, different is not followed by from at all. 

Examination of discontinuous collocations suggests that a search of up to 
about five places either side of a key word is necessary to get a reasonably accu- 
rate picture of the frequency of a particular collocation. Simple computer 
programmes which identify a key word or node in context typically highlight 
words immediately adjacent to the right or left of the key word. It is also possi- 
ble, however, to get the programmes to identify discontinuous collocations in 
text. 

Even more striking than the possible discontinuity in collocations is the 
fundamental issue of the different functions of formally identical collocations. 
Consider the collocation at the turn of in Table 1. It is shown as occurring five 
times. These tokens were as follows: 



is not to be sneezed at 
there is no chance at all 
in no time at all 
some at least of 
for me at any rate 
none at all 
love at first sight 
if at all 



1 

1 

5 

1 

2 

1 

2 

4 

1 

1 

1 

1 

4 

2 

1 

4 

4 



make yourself at home 

what you are driving at 

it was really no problem at all 

what on earth was he playing at 

near at hand 

what is at stake 

he was upset at being 

yet, at the same time, 

significant at the n% level 




225 



13 



1. at the turn of a knob 

2. at the turn of the stairs 

3. at the turn of the path 

4. at the turn of the century 

5. at the turn of Leo's key. 

Setnantically these have been little in common. In context, the first is an 
adverbial of manner. The second and third are locative, while the last two 
temporal. 

Similarly, at once occurs 98 times in Table 1. Close examination of the 
collocations in context, however, shows that there are two quite different func- 
tions. 

1. immediately (eg I replied at once) 

2. simultaneously (eg I can't be everywhere at once). 

In the LOB corpus, 89 out of the 98 tokens of at once mean immediately, 
and the remaining nine are used to mean simultaneously. 

Collocations, of course, are frequently made up of more than two words. 
As noted above, FROM is immediately preceded by different on 21 occasions. In 
the case of fifteen of these occurrences, there is a preceding quantificational 
word showing a tendency to hyperbole, as Table 8 shows. 



Table 8 Words which precede different from in the LOB corpus 

No. of tokens 

very different from 3 
so different from 3 
fundamentally different from 2 
little different from 
too different from 
completely different from 
significantly different from 
totally different from 
utterly different from 
essentially different from 



226 



14 



timcs A Ft i nr r t r dCnCy t0 >f rb ° le is seen support from which occurs 9 
* SET WWdS Wh ' Ch P- Cdc - W- « influential, utmost, 

nrovsi • U, '- h ^ ? X f m P ,C J ? f h0W statistical information on collocations might 
fhearfiVrt- k" ? d,rncnsi0ns of the 'anguage learner's task can be seen in 
he adjecuves which typically precede each of the four prepositions discussed in 
th.s paper. Table 9 contains the examples which occurred two or more times 
arc also s^r kin 1 "" ^ adjCCtlVCS - quantifiers almost entirely differeni, but there 
arc also striking differences in the actual numbers of adjectives which occur 
before each preposition. Available and far are the only adjectives in the table 
which precede more than one of the prepositions 

strikl 5 ^ ^ ClCar \ then ' that ^putcr-ba.ed analysis of text can provide 
striking, ottcn previously unknown information about the way a language fits 
together - something which is not grammar in the sense usually used by lingu 
because collocational studies go beyond systemic possibility by adding a statisti- 
cal aspect, an aspect based on actual use. 

The data described in this paper is of course indicative rather than comprc- 

ve<T ar a t? yS eXP ' 0iting S " Ch information ^ teaching are no. 

roLtT' i ?™ SCC n * r nCVerthcIess . that some items that have usually been 
IZblt PCdag ° g,Ca "y from , a g^matiea! perspective can be treated more as 
teachina ^' ,1?° "* P 0ssibilites - te™s of approach, experiential 

Irammf r T , I' t ^ CStabHshcd ^ imp ° rtant f ° r thc - chi "g * *>th 
grammar and vocabulary. Interactional activities requiring, for example, the 

matching of collocations with glosses are consistent with communicative lan- 
guage caching procedures. Cloze exercises which are often used for both 
vocabulary and grammar teaching can encompass collocations - the focus being 
on both form and meaning. h 

Reading activities can also be important for learning collocations. Texts for 
reading are often selected or modified on orthodox vocabulary grounds and 
there is typically some gradation or sequencing of grammar teaching. Systematic 
exposure to the most frequent legalized collocations could be another criterion 

which V S an °^ Cr - aPPr ° aCh t0 thc leamin S and tcachi "g of Prepositions 
which needs considering m light of thc data I have described. If little of thc 
richness and complexity of English prepositional use is captured by teaching 
prepos.Uons as grammar perhaps they should not be taught at all, but rather left 
«nor- r°M ^""gh language experience, recognizing nevertheless that 
experiential learning, while natural, is not necessarily time efficient That is -i 

JET, r rr?" bC rCS ° ivcd * ™ rc search into 

the effects of different pedagogical practices. 

What text-based collocational studies do suggest is that the description of 
grammar is, from the teacher's point of view, an essential part of methodology, 
but ,t needs to be based on more than thc orthodox grammatical and lexical 



227 !5 



Tabic 9 Adjective-preposition collocations 
-AT -FROM -BETWEEN -THROUGH 



present 


10 


far 


50 


good 


10 


different 


21 


more 


8 


free 


21 


available 


5 


absent 


11 


old 


5 


remote 


8 


active 


4 


safe 


5 


alone 


4 


clear 


5 


high 


4 


distinct 


5 


open 


4 


apparent 


4 


significant 


4 


exempt 


4 


hard 


3 


effective 


3 


little 


3 


evident 


3 


outstanding 


3 


forthcoming 


3 


possible 


3 


fresh 


3 


straight 


3 


immune 


3 


useful 


3 


isolated 


3 


aghast 


2 


available 


2 


agreed 


2 


attractive 


2 


alarmed 


2 


best 


2 


brown 


2 


distant 


2 


cheap 


2 


distinguish- 




clear 


2 


able 


2 


important 


2 


indistinguish 




mad 


2 


able 


2 


necessary 


2 


due 


2 


repayable 


2 


inseparable 


2 


sad 


2 


familiar 


2 


strong 


2 


obvious 


2 


uncomfortable 


2 


latest 


2 


usual 


2 


necessary 


2 


warm 


2 







description. Just as the teacher of botany docs not take students into the jungle 
and expect them to learn about all the plants by simply being exposed to them, 
so the language curriculum designer and classroom teacher can facilitate learn- 
ing by systematic presentation of the role of important language items and their 
linguistic ecology - the company words keep. 



228 



lo 



Whether we learn and use prepositions as parts of collocations or routines 
than as grammatical devices differing only on semantic grounds cannot be of 
course resolved on the basis of the data I have described. But we can be sure that 
there arc more regularities in prepositional use than it has hitherto been possible to 
demonstrate, and that habit formation as part of language learning need not be 
inconsistent with post-behaviourist learning models. The study of collocations may 
thus have implications for our theories of language learning and for theories and 
models of language processing, as well as for the content of language teaching 
syllabuses, and pedagogical practices. 

REFERENCES 

BECKER, J D. 1975. 'The Phrasal Lexicon' in B Nash-Webher and R Schank (t\ls) Ineoretical 
Issues in Natural Language Processing, I Cambridge, Mass.: Bolt, Beranek and Newman. 

BENSON, M; E Benson and R llson. 1986. The Bill Combinatory Dictionary of English. Amster- 
dam: John Benjamins. 

CHOMSKY, N. 1959. Review of Skinner's Language Behaviour 35, 1:26-58. 

. 1965. Aspects of the Theory of Syntax, Cambridge, Mass.: MIT fress. 

CONRAD, J. 1904. Nostromo. Harmondsworth: Penguin Books. 

CROOKES, G. 1986. Task Classification: A Cross -Disciplinary Review Technical Report No. 4, 

Department of English as a Second language. Honolulu: University of Hawaii. 
JOHANSSON, S; G N Uech and 11 Goodluck. 1978. Manual of Information to Accompany the 

I^ancastcr-Oslo.Bergen Corpus of British English for use with Digital Computers. University of Oslo. 
KRASHEN, S and R Scarcetla. 1978. 'On Routines and Patterns in Language Acquisition and 

Performance' Language looming 28, 2: 283-300. 
NATTINGFA J R- 1980. 'A lexical Phrase Grammar for ESL' TESOL Quarterly, 14, 3: 337-344. 
PAWl,EY, A and F N Syder. 1983. 'Two Puzzles for Unguistic Theory: Native-like Selection and 

Native-like Fluency' in J C Richards and R W Schmidt (eds) language and Communication. 

Longman. 

PETERS, A M. 1080. I7ie Units oflMngauge Acquisition' University of Hawaii Working Papers in 
linguistics, 12, I: 1-72. 

SlNCl^UR, J. Mcli 1985. 'Selected Issues' in R Quirk and H G Widdawsan (eds) English in the 

World. Cambridge: Cambridge University Press. 

. 1987. 'Collocation: a Progress Report' in R Steele and T Threadgold (eds). Language 

Topics, Essays in Honour of Michael HaUiday. Vol 2. Amsterdam: John Benjamin. 
. etal. 1987. The Collins Cobuild English language Dictionary. London: Collins. 



17 

229 



